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Optimal Per-Edge Processing Times in the Semi-Streaming Model* 
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We present semi-streaming algorithms for basic graph problems that have optimal per-edge processing times 
and therefore surpass all previous semi-streaming algorithms for these tasks. The semi-streaming model, which 
is appropriate when dealing with massive graphs, forbids random access to the input and restricts the memory to 
Oin ■ polylogn) bits. 

Particularly, the formerly best per-edge processing times for finding the connected components and a bipartition 
are C(a(n)), for determining fc-vertex and fc-edge connectivity C(fc 2 n) and 0(n-log n) respectively for any constant 
k and for computing a minimum spanning forest O(logn). All these time bounds we reduce to 0(1). 

Every presented algorithm determines a solution asymptotically as fast as the best corresponding algorithm 
up to date in the classical RAM model, which therefore cannot convert the advantage of unlimited memory and 
random access into superior computing times for these problems. 
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1. Introduction 

When facing computational tasks on massive 
graphs the postulate of the classical RAM model, 
that is, storing the whole input in memory allow- 
ing random access to it, is no longer adequate. 
In fact, information building up the graph may 
arrive at no specified order and the attempt of 
completely storing it exceeds common main mem- 
ories. Regarding this Muthukrishnan 8 J 2003 pro- 
posed the semi-streaming model as a more restric- 
tive model of computation. According to this the 
edges of the input graph G appear at arbitrary or- 
der and the memory is limited to 0(n- polylogn) 
bits, where n is the number of vertices in 67. An 
important parameter of a semi-streaming algo- 
rithm is described by the per-edge processing time 
T, i.e., the time the algorithm needs to handle 
each single edge. This time determines at which 
frequency the edges may arrive. The second pa- 
rameter of the semi-streaming model denotes the 
number P of passes the algorithm takes over the 
input stream. All considered algorithms in this 
paper use only one pass. 

Despite the heavy restrictions in the semi- 
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streaming model there are algorithms known solv- 
ing basic graph problems. In [3] semi-streaming 
algorithms are given for computing the connected 
components and a bipartition of a graph as well 
as a minimum spanning tree of a weighted graph. 
There are approaches to determine the k-edge 
connectivity [5] and the /c-vertex connectivity 
[5].[T3"] of a graph for any constant k. 

In this paper we present semi-streaming algo- 
rithms for computing the connected components 
and a bipartition of a graph, to calculate the k- 
vertex and fc-edge connectivity for any constant 
k and to find a minimum spanning forest MSF. 
All these algorithms have constant and therefore 
optimal per-edge processing times. 

Section [5] gives the usual definitions, in Section 
[31 we discuss our definition of the per-edge pro- 
cessing time which is a slight refinement of pre- 
vious definitions. We develop our semi-streaming 
algorithms in Section [4] In Section [5] we debate 
on how the obtained algorithms compete with the 
corresponding algorithms in the RAM model. A 
final conclusion is found in Section [5] 

2. Preliminaries and Definitions 

By G we denote a graph G(V, E) with vertex 
set V and edge set E. We call n = \V\ and 
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m = \E\ the number of vertices and edges re- 
spectively. Every graph considered in this paper 
is undirected and contains no loops but might 
have multiple edges. For computing an MSF we 
consider G to be a weighted graph, that is, with 
a nonnegative weight associated with each edge. 
Regarding the memory constraints of the semi- 
streaming model we assume every weight to be 
storable in O(polylogn) bits. 

We define a(m,n) to be a natural inverse of 
Ackermann's function A(-,-) as defined in [12] : 
a(m,n) :— min{i > 1 | A(i, \m/n\) > logn}. We 
abbreviate a(n) to denote a(n,n). 

Bipartition. A graph G is called bipartite if the 
vertices can be split in two parts, a bipartition, 
such that no edge runs between two vertices in the 
same parts. The problem of finding a bipartition 
is to find two such parts or stating that there is 
no bipartition since the graph is not bipartite. 

Connectivity. We name two vertices connected 
if there is a path between them. A graph G is con- 
nected if any pair of vertices in G is connected, a 
connected component of G is an induced subgraph 
C of G such that C is connected and maximal. A 
spanning forest of G is a subgraph of G with- 
out any cycles having the same connected com- 
ponents as G. Given a positive integer k, a graph 
G is said to be k-vertex connected (k-edge con- 
nected) if the removal of any k — 1 vertices (edges) 
leaves the graph connected. A subset S of the ver- 
tices (edges) of G we call an l-separator (I -cut) 
if I = \S\ and the graph obtained by removing 
S from G has more connected components than 
G. The local vertex-connectivity n{x,y\G) {local 
edge- connectivity X(x,y;G)) denotes the number 
of vertex-disjoint (edge-disjoint) paths between x 
and y in G. By a classical result of Menger (see 
e.g. |T]) the local vertex- (edge-) connectivity be- 
tween x and y equals the minimum number of 
vertices (edges) that must be removed to obtain 
x and y in different connected components. 

MSF/MST. For an edge- weighted graph G the 
minimum spanning forest MSF is a subgraph G' 
of G with minimum total cost consisting of the 
same connected components as G. If G is con- 



nected we name G' , which is then connected as 
well, the minimum spanning tree MST of G. 

Certificates. Given any graph property V and 
a graph G, a certificate of G for V is a graph G' 
on the same vertex set such that G has V if and 
only if G' has V . 

For any graph G on vertex set V and any prop- 
erty V a strong certificate of G for V is a graph 
G' on vertex set V such that for any graph H on 
V, G U H has V if and only if G' U H has V. 

A certificate is said to be sparse if the number 
of edges is 0(n). 

Semi-Streaming Algorithm. A graph stream 
of a graph G is a sequence of the m edges of G 
in arbitrary oder. A semi-streaming algorithm A 
gets a graph stream as an input and is restricted 
to use a space of at most 0(n ■ polylogn) bits. 
The algorithm may access the input stream for 
P passes in a sequential one-way order. All algo- 
rithms considered in this paper use only P = 1 
pass. The per-edge processing time T of A we de- 
fine to be the minimum time allowed between the 
revealing of two consecutive edges in the input 
stream. That definition of T renders the defini- 
tions of previous papers more precisely, we give 
a discussion concerning that in Section [3l There 
we also comment on the computing time which 
denotes the total time required by A to determine 
the property in question of the input graph. 

3. Discussion of Per-Edge Processing Time 

In previous papers about semi-streaming algo- 
rithms that consider the per-edge processing time 
T (|3].[5].[T3"]). T is used in an ambiguous way. 
While being used as the worst-case time to pro- 
cess a single edge on the one hand it is equally 
used on the other hand, even if not explicitly 
stated, as amortized time charged over the num- 
ber of edges. In fact, if tools as dynamic trees or 
disjoint set data structures are utilized they give 
rise to amortized times since their time bounds 
are of amortized type, too. Processing the input 
edges is then assumed to be evenly spread over 
the whole computing time which is just m ■ T. 

This definition is not appropriate for a stream- 



Optimal Per-Edge Processing Times in the Semi-Streaming Model 



3 



ing algorithm: As Muthukrishnan[5j pointed out 
the computing time, i.e., the time to evaluate the 
property in question for items read in so far, is 
not the most important parameter of a stream- 
ing algorithm. What is more crucial is the max- 
imum frequency of incoming items that can still 
be considered by the algorithm. That refers to 
the speed at which external storage devices can 
present their data content to a streaming algo- 
rithm and constitutes the frequency at which ob- 
served phenomena can be taken into account. To 
this aim it is desirable to maximize the possible 
rate of incoming items by postponing as much 
operations as possible to a point after which all 
items are received, possibly accepting a higher 
computing time. 

To model this worthwhile property of a stream- 
ing algorithm A we propose the definition of the 
per-edge processing time T to be the minimum 
allowable time between two consecutive edges in 
the graph stream. The final determination of the 
property in question may require some postpro- 
cessing after reading all input edges. This time is 
considered in the computing time which incorpo- 
rates the sum of the per-edge processing times of 
all edges and the postprocessing time. 

4. Computing Certificates and Buffering 
Edges 

To achieve our optimal per-edge processing 
times we exploit the general method of sparsifi- 
cation as presented by Eppstein et al.(3|. Feigen- 
baum et al.[5] pointed out how the results of [3] 
can be adopted for the semi-streaming model. 
Thus they received the formerly best bounds on T 
for almost all problems considered in this paper. 
We refine their method to obtain an improvement 
of their results. For a comparison of our new 
bounds with the previous ones see Table [TJ 

Due to the memory limitations of the semi- 
streaming model it is not possible to memorize 
a whole graph which is too dense, that is, if 
m/n 3> logn. A way to determine graph prop- 
erties without completely storing the graph is to 
find a sparse certificate C of the graph for the 
property in question. Consisting of a linear num- 



ber of edges the certificate can be stored within 
the memory restrictions and testing it answers 
the question for the original graph. The con- 
cept of certificates has been applied for the semi- 
streaming model in [5] and [13) . However, in [13] 
every input edge initiates an update of the certifi- 
cate which is time-consuming and avoids a faster 
per-edge processing. 

To increase the manageable frequency of in- 
coming edges, updating the certificate can be 
done not for every single edge but for a group of 
edges. While considering such a group of edges 
the next incoming edges can be buffered to com- 
pose the group for the following update. 

To permit this updating in groups of edges the 
utilized certificate must be a strong certificate, 
an assumption that is not required in [13j . That 
is because strong certificates obey two important 
attributes for any fixed graph property: Firstly, 
they behave transitively, that is, if C is a strong 
certificate for G and C is a strong certificate for 
C, then C is a strong certificate for G. Secondly, 
if G' and H' are strong certificates of G and H 
respectively, then G' U H' is a strong certificate 
of G U H. 

The technique of group-wise updating is used 
by Eppstein et al.[3] yielding fast dynamic al- 
gorithms and has been transferred to the semi- 
streaming model by Feigenbaum et al.pj]. The 
following theorem is a slightly extended version 
of their result augmented with space considera- 
tions. We will need details of the proof later on. 

Theorem 1 Let G be a graph and let C be a 
sparse and strong certificate of G for a graph 
property V . If C can be computed in space 0{m) 
and time f(n,m), then there is a one-pass semi- 
streaming algorithm building C of G with per-edge 
processing time T = f(n, 0(n))/n. 

Proof. We denote the edges of the input stream 
as e\ , e%, . ■ ■ , e m and the subgraph of G contain- 
ing the first i edges in the stream as Gi. We 
inductively assume that we computed a sparse 
and strong certificate Cj n of the graph Gj n for 
1 < j < [m/nj using a time of f(n,0(n))/n 
per already processed edge. During the com- 
putation of Cj n we buffered the next n edges 

Cjn+lj Gjn+2j • • • j e (j+l)n' 
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Table 1 



Previously best per-edge processing times T compared to our new bounds 



Problem 


Previous Best T 


New T 


Connected components 


0(a(n)) 


0(1) 


Bipartition 


0(a(n)) 


0(1) 


{2,3}-vertex connectivity 


0{a{n)) 


0(1) 


4- vertex connectivity 


O(logn) 


0(1) 


k- vertex connectivity 


0{k 2 n) 


0(1) 


{2,3}-edge connectivity 


0(a(n)) 


0(1) 


4-edge connectivity 


0(na(n)) 


0(1) 


k-edge connectivity 


0(n ■ logn) 


0(1) 


Minimum spanning forest 


O(logn) 


0(1) 



All previous bounds are due to [5], apart from k- vertex connectivity which is a result of .13]. k is any 
constant, a(n) the inverse of Ackermann's function. 



Because of the properties of strong certificates 
T = C jn U {ej n+ i, ejn+2, ■ ■ ■ , e( i+ i)„} is a strong 
certificate for G( J+1 )„. Since Cj n is sparse, T con- 
sists of 0(n) edges as well. Computing CV,- +1 ) n 
as a sparse and strong certificate of T can be re- 
alized in a space linear in the space needed to 
memorize the edges of T, which is 0(n -polylogn) 
bits, without exceeding the memory limitation 
of the semi-streaming model. By transitivity 
C(j_|_i)„ is a strong certificate of G(j+x)n- A time 
of f(n,0(n)) suffices to compute C7y + i) n , hence 
the input edges can arrive with a time delay of 
f(n,0(n))/n building the group of the next n 
edges to update the certificate after the compu- 
tation of C(j + i\ n is completed. 

Finally for k — [m/n\ the last group of edges 
{efc, l+ i,e fc „ +2 , . . . ,e m } can simply be added to 
Ck n to obtain a sparse and strong certificate of 
the input graph G for the property V . □ 

To obtain our semi-streaming algorithms with op- 
timal per-edge processing times, all that remains 
to do is to present the required certificates and to 
show in which time and space bounds they can 
be computed. At first glance it may seem sur- 
prising that Feigenbaum et al.[5] using the same 
technique of updating certificates with groups of 
edges do not meet the bounds we present in this 
paper. The reason is that they just observe that 
results of Eppstein et al.[3j can be transfered to 



the semi-streaming model. However, Eppstein et 
al. develop dynamic graph algorithms requiring 
powerful abilities: The algorithm must be able to 
answer a query for the subgraph of already read 
edges at any time and it must handle edge dele- 
tions. In the semi-streaming model the property 
is queried only at the end of the stream and there 
are no edge deletions. Thus we can drop both re- 
quirements for faster per-edge processing times. 

In the following the input graph for our semi- 
streaming algorithms is denoted by G with n ver- 
tices and m edges as usual. 

4.1. Connected Components 

We use a spanning forest F of G as a certificate. 
F is not only a strong certificate for connectiv- 
ity it also has the same connected components as 
G. F can be computed by a depth-first search in 
time and space of 0(n + m) and is sparse by defi- 
nition. Using Theorem[T]we get a semi-streaming 
algorithm computing a spanning forest of G with 
per-edge processing time T — 0(1). To identify 
the connected components of G in the postpro- 
cessing step we can run a depth-first search on 
the final certificate in time 0(n). The resulting 
computing time is m ■ T + O(n) — 0(n + m). 

4.2. Bipartition 

As a certificate for bipartiteness of G we use 
F + , which is a spanning forest of G augmented 
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with one more edge of G inducing an odd cycle if 
there is any. If no such cycle exists F + is just a 
spanning forest. By [3] F + is a strong certificate 
of G and sparse by definition. It can be com- 
puted by a depth-first search which is alternately 
coloring the visited vertices and is therefore able 
to find an odd cycle. To do so a time and space 
of 0(n + to) suffices, yielding a semi-streaming 
algorithm with T = 0(1). On the final certificate 
we can run again a depth-first search coloring the 
vertices alternately in time 0(n) during the post- 
processing step. That produces a bipartition of 
the vertices or identifies an odd cycle in G in a 
computing time of 0(n + to). 

4.3. k- Vertex Connectivity 

For fc-vertex connectivity, k being any con- 
stant, we use as a certificate of G a subgraph Ck 
which is derived by an algorithm presented by 
Nagamochi and Ibarakl9 . Ck can be computed 
in time and space of 0(n + to), contains at most 
kn edges and is therefore sparse. Beyond it, as 
a main result of [9] Ck preserves the local vertex 
connectivity up to k for any pair of nodes in G: 

K(x,y;C k ) >min{K(x,y;G),k} Vx,yeV (1) 

This quality of Ck leads to useful properties: 

Lemma 2 Every l-separator S in Ck, I < k, is 
an l-separator in G and its removal leaves the 
same connected components in both Ck \ S and 
G\S. 

Proof. In Cfe \ S we find two nonempty, disjoint 
connected components X and Y with vertices x G 
X and y 6 Y . Assume that S is not an /-separator 
in G, therefore there exists a path Z from x to y 
in G\S. Let x' be the last vertex on Z in X and y 1 
the first one in Y . The part of Z between x' and y 1 
we call Z' . In Ck we find at most I vertex-disjoint 
paths between x' and y' , all of them using vertices 
of S. In G these paths exist as well with the 
additional path Z' which is vertex-disjoint from 
the other paths by construction. Therefore the 
local connectivity between x' and y' in G exceeds 
that in Ck contradicting property Q] of Ck- 

Since Ck \ S is a subgraph of G \ S every con- 
nected component of Ck\S is included in one con- 
nected component of G \ S. Assume that W is a 



connected component in G\S which contains two 
vertices i and j within different connected com- 
ponents of Ck \ S, namely I 3 i and J 3 j. As in 
the first part of this proof we can find a path Z 
from i to j in W with x' being the last vertex in 
I and y' the first one in J on Z. We can deduce 
the same contradiction as above. □ 

So Ck is usable for our purposes: 

Lemma 3 Ck is a strong certificate for k-vertex 
connectivity ofG. 

Proof. If CfcUTJ is k- vertex connected then GUH 
including Ck U H as a subgraph is k- vertex con- 
nected as well. Assume for the proof of the con- 
verse direction that G U H is fc-vertex connected 
and Ck U H is not. Then U H contains an l- 
separator S for some I < k. After the removal of S 
the remaining vertices of Ck U H can be grouped 
into two nonempty sets A and B, such that no 
edge joins a vertex of A with a vertex of B. It 
is immediate that H does not contain any edges 
between A and B. 

Clearly, removing S from Ck produces the same 
sets A and B, still with no edge joining them. The 
properties of Ck shown in Lemma [2] make sure 
that the removal of S from G leaves A and B 
without any joining edge, too. With H having no 
edges between A and B the graph G U H cannot 
be A:-vertex connected. □ 

Using Theorem [T] yields a semi-streaming algo- 
rithm computing a sparse and strong certificate of 
fc-vertex connectivity in per-edge processing time 
T = 0(1). To test the final certificate for k- 
vertex connectivity in a postprocessing step we 
can use an algorithm of Gabow[7 on it. That al- 
gorithm runs in time 0((fc 5 / 2 + n)kn) = 0(kn 2 ) 
and, what is more important, uses a space lin- 
ear in the number of edges of the final certificate, 
hence is respecting the memory constraints of the 
semi-streaming model. The resulting computing 
time is 0(m + kn 2 ). 

4.4. /c-Edge Connectivity 

We use the same Ck as utilized in Section l4~3l 
produced by the algorithm of Nagamochi and 
Ibaraki presented in [}J], where it is shown that 
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Ck reflects the local edge-connectivity of G in the 
following way: 

\(x,y;C k )>mm{\(x,y;G),k} \fx,y eV (2) 

Therefore Lemma [2] and Lemma [3] can be formu- 
lated and proven with respect to Z-cuts, I < k, 
and fc-edge connectivity. Accordingly we have 
a semi-streaming algorithm computing a strong 
and sparse certificate for fc-edge connectivity us- 
ing T — 0(1). To determine fc-edge connectivity 
of the final certificate we can use an algorithm of 
GabowJB] using a space linear in the number of 
edges of the final certificate. It takes a time of 
0(m + fc 2 nlog(n/fc)) which is also the resulting 
computing time of our semi-streaming algorithm. 

4.5. Minimum Spanning Forest 

Let us first take a look at the algorithm we 
use as a subroutine for our semi-streaming al- 
gorithm computing an MSF of a given graph. 
We utilize the MST algorithm of Pettie and 
Ramachandran [TTj which uses a space of 0(m). 
A remark on how we use an algorithm comput- 
ing an MST to obtain an MSF we give below. 
The algorithm of [UJ uses a time of 0(T* (m, n)), 
where T*(m,n) denotes the minimum number of 
edge-weight comparisons needed to find an MST 
of a graph with n vertices and m edges. The algo- 
rithm uses decision trees which are provably opti- 
mal but whose exact depth is unknown. Because 
of that the exact running time of the algorithm is 
not known even it is optimal. 

The currently tightest time bound for the MST 
problem is given by algorithms due to Chazelle[2 
and Pettie [TO] that run in time 0(m ■ a(rn,n)). 
Consequently the optimal algorithm of Pettie and 
Ramachandran [TJJ inherits this running time, 
T*(m,n) = 0(m ■ a(m,n)). Based on the def- 
inition a(m, n) — 0(1) if m/n > logn. Therefore 
on a sufficiently dense graph the algorithm of [TTJ 
computes an MST in time 0(m). 

Using this optimal algorithm as our subroutine 
we can find a semi-streaming algorithm with per- 
edge processing time T — 0(1) in the following 
way. We use the technique described in Theorem 
[TJ of merging a computed subgraph with buffered 
edges and then calculating a new subgraph of the 
merged graph while buffering the next group of 



edges. Unlike before we use groups of edges con- 
sisting of r = n ■ logn edges instead of n. Such a 
number of edges can be memorized in the semi- 
streaming model using 0(n ■ polylogn) bits, even 
if weights are assigned to the edges which we as- 
sume to be storable in 0(polylogn) bits each. 

By taking up the notation of the proof of The- 
orem [TJ Cj r is the memorized MSF of the graph 
Gj r made up of the edges ei, e2, . . . , e jr . We 
merge the buffered next r edges with Cj r to ob- 
tain T — Cj r \j{ej r +i, ej r +2, ■ ■ ■ , e (j+l)r}- F° r t ne 

number mj of edges in T we have ttit > n ■ log n 
and therefore the optimal MST algorithm uses a 
time of 0(ttit) to compute the MSF 0(j+i) r of T. 
Since my < 2r the computation of C(j+-±\ r takes 
a time of 0(r). To fill the buffer of the next r 
edges in the meantime, the edges can arrive with 
a time delay of 0(1). 

It remains to show that what we compute in 
the described way is indeed an MSF of the in- 
put graph G. Every edge of Gj r that is not in 
Cj r is the heaviest on a cycle in Gj r and cannot 
be in an MSF of Gj r . On the other hand Cj r 
does not contain any dispensable edges since it 
includes no cycles: The removal of any edge from 
Cj r produces two connected components in Cj r 
whose vertices form a common connected com- 
ponent in Gjr- Therefore Cj r forms an MSF of 
Gjr , inductively showing that we really obtain an 
MSF of G in this manner. 

Now we can state the computing time of our 
semi-streaming algorithm which depends on the 
density of the input graph G. If G has at most 
r = nTogn edges, all edges are read and buffered 
in time 0(m) and then the optimal algorithm 
of Pettie and Ramachandran TJJ computes an 
MSF in time 0(T*(m, n)), producing a comput- 
ing time of 0(T*(m, n)), since f2(m) is a lower 
bound for T*(m,n). 

If G has more than r edges we successively up- 
date an MSF with groups of edges. Note that, dif- 
ferent from the described procedure in the proof 
of Thcorcm[TJ the last group of edges is not simply 
merged to the up to now computed Gy m / r \r- In- 
stead the MSF of the merged graph is calculated 
to obtain the final MSF, which is also the MSF of 
the input graph, in the postprocessing step. We 
can fill the last group of edges up to a complete 
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group of r edges by using dummy edges weighted 
heavier than any edge in the input stream. This 
way we ensure that the last merged graph for the 
postprocessing with m/ > r edges is sufficiently 
dense for the optimal MST algorithm running 
on it. So for the postprocessing time we have 
0(T*(m f ,n)) = 0(m f ■ a(m f ,n)) = 0(m f ). 
Therefore the computing time is 0(m)+0(m/) = 
0(m), which is trivially 0(T*(m, n)). 

Let us give two minor remarks about the algo- 
rithm of Pettie and Ramachandran[Tl] we use. 
Firstly, the algorithm of [11 assumes the edge 
weights to be distinct. We do not require that 
property since ties can be broken while reading 
the input edges in a way described in [3]. Sec- 
ondly, the algorithm of [11] works on connected 
graphs. Before running it, we can use a depth- 
first search to identify the connected components 
which are then processed separately. Identify- 
ing the connected components takes a time of 
(D(m) = C(T*(m, n)), so the running time of our 
subroutine persists as well as the per-edge pro- 
cessing time of our semi-streaming algorithm. 

5. Discussion 

In this section we compare the obtained semi- 
streaming algorithms to algorithms determining 
the same properties in the classical RAM model 
allowing random access to all the edges of a graph 
without any memory constraints. 

First note that the presented semi-streaming 
algorithms have optimal per-edge processing 
times, that is, no semi-streaming algorithm ex- 
ists allowing asymptotically shorter times: Every 
single edge must be considered to determine a so- 
lution for the problems considered in this paper, 
so a time of per edge is a lower bound for 
these problems. 

Let us now take a look at the presented semi- 
streaming algorithms testing fc-vertex and fc-edge 
connectivity. For k- vertex connectivity with k be- 
ing a constant the fastest algorithm in the RAM 
model to date is due to Gabow[7j which runs 
in 0(kn 2 ). Gabow obtains this result even in 
graphs with multiple edges by preprocessing the 
input graph with the algorithm of Nagamochi 



and Ibaraki[9] in time 0{m) producing a running 
time of 0(kn 2 + m) on graphs and multigraphs. 
This asymptotically equals our computing time, 
which is not surprising since we use Gabow's al- 
gorithm as our subroutine. The same situation 
we find when looking at fc-edge connectivity. Our 
achieved computing time of 0(ra + k 2 n log(n/fe)) 
is asymptotically as fast as the fastest algorithm 
in the RAM model due to Gabow [5] which we use 
as a subroutine. So both our connectivity algo- 
rithms have a computing time that is asymptoti- 
cally the same as the fastest known corresponding 
algorithms in the RAM model. 

It is possible that there are faster but still un- 
known algorithms in the RAM model for k- vertex 
and fc-edge connectivity which cannot be utilized 
in the semi-streaming model because they con- 
sume to much space. The converse is true for the 
problems of finding connected components, a bi- 
partition and an MSF of a given graph. The pre- 
sented semi-streaming algorithms have asymptot- 
ically the same computing time as the fastest pos- 
sible algorithms in the RAM-model. That can 
easily be seen for connected components and bi- 
partition: We obtain in each case a computing 
time of 0(n + m) which is trivially a lower bound 
for any algorithm in the RAM model solving these 
problems. For computing an MSF we get a com- 
puting time of 0(T*(m,n)), where T*(m,n) is 
the lower time bound for any RAM algorithm. 

For the asymptotic time needed to determine a 
solution there is no difference for /c-edge and k- 
vertex connectivity between the currently fastest 
algorithms in the RAM model and the presented 
semi-streaming algorithms. Unless faster con- 
nectivity algorithms in the RAM model are de- 
veloped there is no demand for a random ac- 
cess to the edges and for a memory exceeding 
0(n ■ polylogn) bits. For computing the con- 
nected components, a bipartition and an MSF 
such a demand will never emerge since the pre- 
sented semi-streaming algorithms have optimal 
computing times. The RAM model cannot capi- 
talize on its mighty potential of unlimited mem- 
ory and random access to beat the computing 
times of the weaker semi-streaming model. 

We close this section by indicating a tradeoff be- 
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tween memory and time when computing an MSF 
in the semi-streaming model. If the memory con- 
straint of the semi-streaming algorithm is reduced 
from 0(n ■ polylogn) to 0(n ■ log 2 ~ £ n) bits, only 
s = o(n ■ log n) edges can be memorized. So 
the optimal MST algorithm we use as a subrou- 
tine needs a time of 0(T* (s,n)). Provided that 
T*(s,n) = lo(s) we obtain a per-edge processing 
time of to(l) and therefore a computing time of 
a; (to). Both bounds are significantly larger than 
the corresponding ones when 0(n- polylogn) bits 
of memory are permitted. However, if it turns 
out that T*(to, n) = 0(m) for any to, it suffices 
to store O(n) edges to obtain both optimal per- 
edge and computing time in the semi-streaming 
model. 

6. Conclusion 

We presented semi-streaming algorithms for 
computing the connected components, a bipar- 
tition, the /c-vertex and fc-edge connectivity for 
any constant k and an MSF of a given graph. 
The presented per-edge processing times T sur- 
pass former semi-streaming algorithms and are 
optimal because they are constant. All intro- 
duced semi-streaming algorithms are asymptot- 
ically as fast as the fastest corresponding algo- 
rithms in the RAM model. For connected com- 
ponents, bipartition and MSF we actually achieve 
the time bounds of the best possible RAM algo- 
rithms. 

The main idea for our semi-streaming algo- 
rithms is quite simple: A sparse memorized sub- 
graph is merged with buffered edges and while 
computing a sparse subgraph of the merged one 
the next edges are buffered. We believe this 
idea to be fruitful for other graph problems as 
well when tackling them without random access 
and within the memory constraints of the semi- 
streaming model. 
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